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Abstract. The notion of symmetry is defined in tfie context of Linear and 
Integer Programming. Symmetric linear and integer programs are studied from 
a group theoretical viewpoint. We show that for any linear program there exists 
an optimal solution in the fixed point set of its symmetry group. Using this 
result, we develop an algorithm that allows for reducing the dimension of any 
linear program having a non-trivial group of symmetries. 



1. Introduction 

Order, beauty and perfection - these are the words we typically associate with 
symmetry. Generally, we expect the structure of objects with many symmetries to 
be uniform and regular, thus not too complicated. Therefore, symmetries usually 
are very welcomed in many scientific areas, especially in mathematics. However, in 
integer programming, the reverse seems to be true. In practice, highly symmetric 
integer programs often turn out to be particularly hard to solve. The problem is 
that branch-and-bound or branch-and-cut algorithms, which are commonly used 
to solve integer programs, work efficiently only if the bulk of the branches of the 
search tree can be pruned. Since symmetry in integer programs usually entails 
many equivalent solutions, the branches belonging to these solutions cannot be 
pruned, which leads to a very poor performance of the algorithm. 

Only in the last few years first efforts were made to tackle this irritating problem. 
In 2002, Margot presented an algorithm that cuts feasible integer points without 
changing the optimal value of the problem, compare [6j. Improvements and general- 
izations of this basic idea can be found in [Tj H] . In |3J |TU] , Linderoth et al. concen- 
trate on improving branching methods for packing and covering integer problems by 
using information about the symmetries of the integer programs. Another interest- 
ing approach to these kind of problems has been developed by Kaibel and Pfetsch. 
In [^, the authors introduce special polyhedra, called orbitopes, which they use 
in |1] to remove redundant branches of the search tree. Friedman's fundamental 
domains in [2] are also aimed at avoiding the evaluation of redundant solutions. 
For selected integer programs like generalized bin-packing problems there exists a 
completely different idea how to deal with symmetries, see e.g. ^j. Instead of elim- 
inating the effects of symmetry during the branch-and-bound process, the authors 
exclude symmetry already in the formulation of the problem by choosing an appro- 
priate representation for feasible packings. 
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All ideas in the aforementioned papers finally rely on the branch- and-bound 
algorithm, or they are only applicable to selected problems. In contrast to this 
optimizational or specialized point of view, we want to approach the topic from a 
more general and algebraic angle and detach ourselves from the classical optimiza- 
tion methods like branch-and-bound. In this paper we will examine symmetries of 
linear programs in their natural environment, the field of group theory. Our main 
objective aims at a better understanding of the role of symmetry in the context of 
linear and integer programming. In a subsequent paper we will discuss symmetries 
of integer programs. 



2. Preliminaries 

Optimization problems whose solutions must satisfy several constraints are called 
restricted optimization problems. If all constraints as well as the objective function 
are linear, we call them linear programs, LP for short. 

The linearity of such problems suggests the following canonical formulation for 
arbitrary LP problems. 

max c'a; 

^^-^ s.t. Ax<b, xeW , 

where A e M™^", b e M™ and c G E" \ {0}. We are especially interested in points 
that are candidates for solutions of an LP. 

Definition. A point x G M" is feasible for an LP if x satisfies all constraints of 
the LP. The LP itself and any set of points is feasible if it has at least one feasible 
point. 

Hence, the set of feasible points X of ([T]) is given by 

X --{xe M" \Ax<b} . 

Convention. We call X the feasible region, c the utility vector and ri the dimension 
of A. The map x i— > c*a; is called the utility function, and the value of the utility 
function with respect to a specific x € M" is called the utility value of x. 

We can interpret the feasible region of an LP in a geometric sense. The following 
definition is adopted from [TT], p. 87. 

Definition. A polyhedron P C M" is the intersection of finitely many affine half- 
spaces, i.e., 

P {a; e M" I ^a; < 6} , 
for a matrix A g M™><" and a vector b e M'". 

Note that every row of the system Ax < b defines an affine half-space. Obviously, 
the set AT is a polyhedron. Since every affine half-space is convex, the intersection 
of affine half-spaces - hence, any polyhedron - is convex as well. Therefore, we can 
now state the convexity of X. 

Remark 1. The feasible region of an LP is convex. 

Whenever we consider linear programs, we are particularly interested in points 
with maximal utility values that satisfy all the constraints. 



SYMMETRIES IN LINEAR AND INTEGER PROGRAMS 



3 



Definition. A solution of an LP is an element x* G K" that is feasible and maxi- 
mizes the utility function. 

If we additionally insist on integrality of the solution, we get a so-called integer 
program, IP for short. According to the LP formulation in (jlj, the appropriate 
formulation for the related IP is given by 

max c*x 

(2) 

s.t. Ax<b, 2; G Z" , 

where A e M™><", b e M" and c e M" \ {0}. 

Analogously, the set of feasible points Xj of ^ is given by 

Xi {x eW\Ax <b, X e Z"} = X n Z" . 

We now want to clarify the meaning of the term symmetry in the context of linear 
and integer programming. 



3. Symmetries 

In general, symmetries are automorphisms, that is, operations that map an ob- 
ject to itself in a bijective way compatible with its structure. Concerning linear 
and integer programs, we therefore have to consider operations that preserve both 
the utility vector and the inequality system, thus, in particular, the polyhedron 
which is described by the inequality system. By the usage of matrix notation, this 
polyhedron is already embedded in Euclidean space M". This is the point where 
we have to decide whether we want to regard M" as an affine or as a linear space. 
In respect of the algorithms we are going to develop, we follow the general ten- 
dency in the literature and choose the linear perspective for the sake of a simpler 
group structure. Hence, the operations we consider are automorphisms of the linear 
space M", that is, elements of the general linear group GL„(M). Furthermore, it 
is reasonable to restrict the set of possible symmetries even to isometries taking 
into account that the angles and the lengths of the edges of the polyhedron need 
to be preserved. Since the set of all automorphisms of an object always is a group, 
we therefore suggest that the symmetries of a linear or an integer program form a 
subgroup of the orthogonal group 0„(M). 

In general, a linear program and the associated integer program need not have the 
same symmetries. The following two examples illustrate this fact. 

Since we are forced to rely on the linear description of an integer program to 
gain information about its symmetries, we want to make sure that any symmetry 
of a linear program is a symmetry of the associated integer program as well. As 
integer programs are naturally confined to the standard lattice Z", we only con- 
sider orthogonal operations that leave the lattice invariant. In particular, such an 
operation represented by a matrix M e 0„(M) maps any standard basis vector Cj 
to an integer vector 

Mcj = (toij, . . . , m„-,)* e Z" , 

which is the j-th column of the matrix M. Hence, all columns of M have to be 
integral, that is, 

A/eO„(M)nGL„(Z) , 
where GL„(Z) is the group of all integrally invertible matrices. 
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Figure 1 . The symmetries of the hnear and the associated integer 
program do not coincide 



Notation. The group of all orthogonal matrices with integral entries 

0„(M)nGL„(Z) <0„(M) 

is denoted by 0„(Z). 

Note that orthogonal matrices with integral entries always are integrally invert- 
ible. We want to learn more about 0„(Z). Since any map M € 0„(Z) preserves 
the distance, the column (mij, . . . , mnj)* is an integer vector of length 1, thus 

(mij, . . . ,TO„j)* € {±ei 1 1 < i < n} 

for I < j < n. Therefore, the set 0„(Z) only consists of signed permutation matri- 
ces. In fact, since every signed permutation matrix is orthogonal and integral, the 
set of signed permutation matrices is equal to 0„(Z). Apparently, the group 0„(Z) 
acts on the set 

{Rei \ l <i<n} . 

The kernel of this action is the group of sign-changes denoted by I?sign- Hence, the 
group Psign is a normal subgroup of On(Z). Furthermore, any element of 0„(Z), 
that is, any signed permutation matrix M has a unique representation M = DP, 
where D is a sign-changing matrix, thus a diagonal matrix with entries ±1 on its 
diagonal, and P is a permutation matrix. Therefore, we have 

0„(Z) = V^ignVn, 

where Vn < 0„(Z) denotes the subgroup of all (n x n)-permutation matrices. 
Since Psign and Vn intersect trivially, we finally conclude that 0„ (Z) splits over I?sign 

Remark 2. The group 0„(Z) is the semidirect product 

0„(Z) = I?,,g„x7'„ . 

In the literature, the group 0„(Z) appears in the context of finite reflection 
groups. More precisely, the group 0„(Z) is the Coxeter group i3„ of rank n, com- 
pare [3], p. 5. 

Due to the invariance of the standard lattice Z" under 0„(Z), the elements of 0„(Z) 
have the potential to satisfy the strict requirements we made on symmetries. That 



SYMMETRIES IN LINEAR AND INTEGER PROGRAMS 



5 



is, elements of 0„(Z) that preserve an LP, i.e., its inequality system and its utility 
vector, also leave the associated IP invariant. 

Remark 3. The invariance of an LP under an element o/0„(Z) implies the in- 
variance of the related IP under the same element. 

However, the reverse does not hold in general, since we can always add asymmet- 
ric cuts to an LP without affecting the set of feasible points of the corresponding 
IP, compare Figure lb. We could now define symmetries of linear and integer 
programs as elements of 0„(Z) that leave invariant the inequality system and the 
utility vector of the problem. But if we take into account the usual linear and inte- 
ger programming constraint x G R>o, which forces non-negativity of the solutions, 
the set of possible symmetries shrinks from 0„(Z) to the group of permutation 
matrices Vn < 0„(Z). 

Now, how should we imagine the action of a symmetry group G <Pn on a linear 
or an integer program? Since G is an automorphism group of M" , its elements per- 
mute the standard basis vectors ei, . . . , e„. Considering the bijective G-equivariant 
mapping 

!■ i, I < i < n , 

the permutation of the subscripts of the basic vectors is an action of a group G" < S„ 
on the set of indices {!,..., n} which is isomorphic to the action of G on the 
standard basis. Hence, we can always think of symmetry groups of linear or integer 
programs as subgroups of S„. 

Remark 4. A group G < S„ acts on the linear space M" via the G-equivariant 
mapping 

/3 : {1, . . . ,n} ^ B : i ^ a , 
where B is the set of the standard basis vectors ei, . . . , e„ o/M". 

Due to Remark [3] we are able to formulate the definition of symmetries of linear 
programs and the corresponding integer programs simultaneously. Consider an LP 
of the form 

max c*x 

s.t. Ax<b, xe M>o , 
and the corresponding IP given by 

(4) 

^ ^ s.t. Ax<b, x& M'-^o, a; e Z" , 

where A e K™><", 6 e E™ and c € M" \ {0}. Note that the LP ^ and the IP Q 
have the additional constraint x <E ]R"q. 

Notation. An LP of the form ^ is denoted by A. 

Apparently, applying a permutation to the matrix A according to Remark [4] 
translates into permuting the columns of A. Since the ordering of the inequali- 
ties does not affect the object they describe, we need to allow for arbitrary row 
permutations of the matrix A. The following definition takes these thoughts into 
account. 



max c^x 
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Definition. A symmetry of a matrix A e jj™^" jg an element g £ S„ such that 
there exists a row permutation a G S,„ with 

P„APg =^ A , 

where and Pg are the permutation matrices corresponding to a and g. The full 
symmetry group of a matrix A G is given by 

{.geS„|3(TeS„, : P^APg = A) . 

A symmetry of a linear inequality system Ax < b, where A € ]R™^", and b G M™, 
is a symmetry g G S„ of the matrix A via a row permutation a G S„i which satis- 
fies b'^ = b. 

A symmetry of an LP A or its corresponding IP is a symmetry of the hnear in- 
equahty system Ax < b that leaves the utility vector c invariant. The full symmetry 
group of A and the corresponding IP is given by 

{5 G S„ I c9 = c, 3(7 G S„, : {b" ^b A P.APg = A)} . 

This is a definition of symmetry as it can be found in literature as well, see e.g. [7]. 

Unfortunately, we cannot predict the effect on the symmetry group in general 
if we add constraints to the inequality system. This is impossible even in the 
special case where the corresponding polyhedron stays unaltered, as we will see in 
Example |10[ However, in some cases we can at least guarantee that the symmetry 
group of the inequality system does not get smaller. 

Theorem 5. Given a symmetry group G < S„ of two inequality systems Ax < b 
and A'x < b', where A G M™^", A' G M"'^", b G M™, and b' G M™', the group G 
also is a symmetry group of the inequality system 

Proof. Let g € G he a, symmetry of Ax < b via the row permutation cr G Sm, and 
a symmetry of A'x < b' via a' G Sm', that is, 

P.APg = A, P^.A'Pg = A', P,b = b, P,,b' - b' . 

Then we get 

P. \ fA\ ^ f PrrAPa \ [A 



and 



P.,)\A')^3 (p^,A'Pgl \A' 



Pa 0\ (b\ _ f P,b\ _ fb 

PaJ [y - [p^,b' - [y 



Hence, the permutation 5 is a symmetry of the inequality system 

a) ^ ^ {b' 

via the row permutation {^^ P ,) ^ ^{m+rn')x{m+m') ^ □ 
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4. Orbits 

The following basic terms, notations and first insights into actions of symmetry 
groups on linear programs will turn out to be useful. 

Definition. Given a group G < S„ and an element x £ M", the orbit x'^ of x with 
respect to G is defined by 

a;G {3.9 I g e G} . 

If G is the symmetry group of an LP with the feasible region X, the group G 
leaves X invariant. Hence, a point x is feasible if and only if all elements of x'^ are 
feasible as well. 

Remark 6. Given a symmetry group G < Sn of an LP A, a point x is feasible 
for A if and only if every element of the orbit x^ is feasible for A. 

The following theorem states that applying symmetries does not change the value 
of the utility function. 

Theorem 7. Let G < S„ &e a symmetry group of an LP A. Given x G M" the 
utility function of A is constant on the orbit x'^ . 

Proof. By definition, every symmetry g £ G fixes the utility vector c. Therefore, 
we have 

c*a;^ = (c^)*x^ = ^(c^)i(a;^)j = ^ CjgXjs = ^ CjsXis = c*x 

for every element x^ of x'~^. □ 

The orbits of two elements x,x € M" are equal if and only if x and x are 
equivalent, i.e., there exists an element g £ G with x^ = x. 

Acting on the standard basis B := {ei, . . . , e„} of M", the group G splits B into k 
disjoint orbits. 

Notation. An orbit of a group action on B is denoted by O, and the set of all 
orbits is denoted by O. The subspace spanned by an orbit is denoted by V. 

Formulating an LP problem, the variables can be named in an arbitrary way. 
Therefore, we can always assume that the decomposition into orbits is aligned to 
the order of the basis B, in the following sense: 

Remark 8. Without loss of generality, the orbits of G on B are given by 

Oi = {ei, . . . ,e„J , 

Oi — {es-_-^+i, . . . ,esi_i+ni} , 
for i £ {2, . . . , k}, where k is the number of orbits, Ui the number of elements in 
orbit Oi, and Si is defined by Si :— ^j- 

Convention. The corresponding spans of the orbits Oi are denoted by Vi. 

Applying Theorem [7] to a unit vector e^, we get some important information 
about the structure of the utility vector c. 

Corollary 9. Let ei,ej £ B be two elements of the same orbit O under a group G. 
Then the entries Ci and Cj of the utility vector c are equal. 
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Referring to Remark [8] the utility vector c has the following structure: 
c= hi,- - j, 71 : 72 ■ , 72 . ■■■, 1k,- - J, Ik f ■ 

We do not want to suppress the fact that there are other ways to define sym- 
metries of linear programs. Apart from the question whether to consider affine or 
linear transformations, another important subject needs to be put up for discus- 
sion. As already mentioned in the introduction, the original motivation for the 
study of such symmetries was the unnecessarily large size of the branch- and-cut 
trees caused by symmetric solutions sharing the same utility value. Hence, we 
should focus on operations that leave invariant the utility vector and the feasible 
region, which is the polyhedron described by the inequality system of the linear 
program. Now obviously, many different inequality systems give rise to the same 
polyhedron. Therefore, the invariance of an inequality system implies the invariance 
of the polyhedron, but the reverse is not true, as the following example illustrates: 

Example 10. Consider the LP given by the utility vector c — (1, 1)* and the 
inequality system 

Xi + X2 < 2 

xi > 

X2 > . 

Obviously, the permutation g = (1 2) e S2 is a symmetry of the inequality system, 
thus a symmetry of the feasible region. If we add the redundant constraint 

xi < 2 , 

the new inequality system describes the same polyhedron. Hence g still is a sym- 
metry of the feasible region, but the inequality system itself does not show any 
symmetry anymore. Adding another redundant constraint 

X2 < 2 , 

we retrieve the original symmetry group of the inequality system, again without 
changing the symmetry group of the feasible region. 

So why did we choose this restrictive definition of symmetries for linear and 
integer programs? The main problem is the lack of apposite descriptions of the 
feasible region. The inequality system is the only source of information in this 
context, and the conversion into a description that provides direct access to the 
symmetries of the feasible region might already be equivalent to solving the problem 
itself. 

5. The Set of Fixed Points 

Symmetries in linear programs do not attract much attention in the literature, 
maybe because they do not influence the performance of standard solving proce- 
dures like the simplex algorithm in a negative way. But even though linear programs 
are solvable in polynomial time, it is always worth looking for generic methods to 
save calculation time. In this section, we will focus on the question how symmetries 
can contribute to this goal. 
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As it will turn out later in this section, the points in M" that are fixed by a 
permutation group G < S„ play the key role in our approach. Hence, we will use 
the first part of this section to tame these points by means of linear algebra. 

Definition. Given a subset Y C E" and a group G < S„ acting on Y, the set of 
fixed points of Y with respect to an element g € G is defined by 

Fix<,(y) ■.= {yeY\yS ^y} . 

Therefore, the set of fixed points of Y with respect to G is given by 

FixG(r) -.^{yeYly^^y for all 5 G G} = f| Fix<,(y) . 

sec 

Recall that a group G < acts on M" as described in Remark |4] thus we inter- 
pret G as a linear group. In terms of linear algebra, the set of fixed points FiXg(M") 
is the eigenspace Eig]^(5) corresponding to the eigenvalue 1. Since FixG(M") is the 
intersection of all of those eigenspaces, the structure of FixG(IR") is not arbitrary. 

Remark 11. The set of fixed points Fix(3(IR") with respect to a group G < S„ is a 
subspace o/M". 

Now let G < S„ be a symmetry group of a linear program A, compare ^. Then 
the utility vector c of the linear program is fixed by every g in G. Since G acts as 
a linear group, the line 

I ■.= {rc\r £ R} 

is pointwise fixed by every g in G, that is, the line I is in FiXg(M") for every g in G, 
thus in the intersection of these sets. 

Remark 12. The line I through the origin spanned by c is a subspace of the set of 
fixed points FiX(3(M"). 

We are particularly interested in the exact dimension of FixG(]R"). By Re- 
mark [12] we already know that FixG(R"') is at least one-dimensional. To determine 
its dimension precisely, we first need to consider the dimension of a certain subspace 
of FixG(M"). 

Lemma 13. Let O be a subset of the standard basis B o/M" and G < S„ a group 
acting transitively on O. Then the intersection FixciV) of the span V :— (O) and 
FixG(M") is determined by 

FixG(y)-(5] e.) . 
In particular, the subspace FixqIV) is one-dimensional. 

Proof. Without loss of generality, let O — {ei, . . . , em}- Since O is invariant under 
G, the vector 

v:^ E e. = (l,...,l,0,...,0)*ey 

is fixed by G, thus 

veVn FixG(M") = FbiaiV) , 
and further (v) C FixQ{V). 

In order to prove the converse inclusion, it suffices to show that the dimension 
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of FixG(V^) is not greater than 1. To this end, we define the (m — l)-dimensional 
subspaces Wj <Vhy 

W, := {0\{e,}) . 

Assume that the dimension of FixG(y) is greater than 1. By the dimension formula, 
we then have 

dim(VFj n FixaiV)) = dimWj + dimFixG(V") - dimV = 
= dimFixG(^^) - 1 > 1 
for every j = 1, . . . , to. In particular, there exists a vector 



0^w:=J2 a^e^ e {W,n H FiXG(l/)) 



with at least one coefficient a; 7^ 0. Since G acts transitively on O, we find an 
element g € G that maps e/ to Cm- Being an element of FixG(V^), the vector w is 
fixed by g. Thus, we can write 



w 

i=l 

contradicting the fact that w G WmCSP\yLG{V) . Consequently, we have dimFixG(F) < 
1, and therefore 

Yi^GiV) = {v) ^ {Y, e.) . 

□ 

By Lemma |13| we are now able to establish a direct relation between the di- 
mension of FixG(IR") and the number of orbits generated by G. For orbits and the 
corresponding spans we use the notation we introduced in Section [4] 

Theorem 14. Let k he the number of orbits of B under G. Then the following 
statements hold: 

i) The set of fixed points with respect to G can be written as 

k 

FixG(M") = 0FixG(F,) . 

1=1 

ii) The set of fixed points FixG(]R") is a subspace ofW^ of dimension k. 

Proof. In both parts of the proof we will use the fact that FixG(K") is a subspace 
of M", which we already know by Remark 11 We start with the proof for the 
special representation of FixG(]R"). 

i) Since the set of orbits O — {Oi, ... ,0k} is a partition of the basis B of R", 
we have 

(5) V, n V, = {0} 

for i ^ j, and further 

k 

M" = V, . 
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Thus, we can write 

FixG(M") =M"nFixG(M") = (^0^*^ nFixG(R") . 

Hence, any point v G FixG(Ili") has a unique representation v — X^iLi^*; 
where Vi € Vi. For this representation, we get for any g (z G 

k k 
i=l i=l 

The uniqueness of the representation impHes that g maps each Vi to a cer- 
tain Vj G Vj of the representation. But since every subspace Vi is invariant 
under G, we get v^ eVi, thus vf — Vi, due to ([5|. Hence, we have proved the 
inclusion 

Cfe \ fe 
n FixG(M") c (y, n FixG(M")) . 
i=l / 1=1 

The converse inclusion is immediate, thus we finally get 

FixG(M") = M" n FixG(M") = ^0 v}jr\ FixG(M") = 

fc k 

= (y, n FixG(M")) = 0FixG(l^.) . 

i=l 2=1 

ii) In order to prove the statement on the dimension of FixG(IR"), we recall that 
G acts transitively on every orbit O;. Therefore, Lemma [13] yields 

dimFixG(V^O = 1 

for all i = l,...,fc. Using i), the dimension of FixG(]R") can therefore be 
computed as 

(fc \ fc 

0FixG(V'O - ^dimFixG(F,) = k . 
i=l / i=l 

□ 

The statement in Theorem [14] is particularly interesting if the group G generates 
only one single orbit. 

Corollary 15. // G acts transitively on the standard basis B , the set of fixed 
points FixG(]R") is one- dimensional. 

We complete our studies on the set of fixed points with a simple example. 



Example 16. Consider the LP given by 
subject to 



XX < 2.5 

X2 < 2.5 
xi + X2 < 3.7 , 
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where 

xi,X2 e M>o . 

Then the LP has the full symmetry group 82- In this special case, the set of 
fixed points Fixs2(Ili'^) coincides with the line I through the origin spanned by the 
utility vector c, compare Remark |12[ The following figure shows the graphical 
representation of the LP. 




Figure 2 . Graphical representation of the LP 



Now we focus on the solutions of the linear program given in Example 16 Ob- 
viously, the point x* is the solution of the LP provided by the simplex algorithm. 
In fact, all points on the bold line parallel to the hyperplane c*x = z are solutions 
of the LP. In particular, this is also true for the intersection point .t^^ e Fixg2(M^). 
Hence, by generalizing Example |16| we get to the assumption that for any n- 
dimensional linear program with full symmetry group G < S„, we can always find 
a solution in the associated set of fixed points FixG(M"). We will check this as- 
sumption in the following section. 

6. Solutions in the Set of Fixed Points 

Before we turn to the main issue, we need to introduce a special representation 
of the barycenter of an orbit, which plays an essential role in our approach. 

Lemma 17. Given x £ M", the barycenter of the orbit x^ can be written as follows: 

Proof. Since the stabilizer G^; is a subgroup of G, we have 

G = U . 

geG 

Let S — {si, . . . , S|a;G|} C G be a set of representatives of the family of cosets Gxg- 
Then 



SYMMETRIES IN LINEAR AND INTEGER PROGRAMS 



13 



Furthermore, the orbit-stabihzer theorem yields the relation 

|a:«| = |G:G.| = M , 

so we get 

Since — x for all g G Gx^ wc have 
and therefore 

Considering the disjoint representation 
of G, we finally obtain 



(as) 



J_ V V t(9") - J- V t9 

|G|Z^Z^^ " |G| ^ 



□ 



The representation of the barycenter provided by Lemma[T7| facilitates the proof 
of the following statement about feasible points in the set of fixed points. 

Theorem 18. Let X be the feasible region of the LP A. If x G M" is feasible for 
A, there exists a feasible point xnx in FixG(E") with the same utility value as x. 

Proof. We define 

Since x^x is the barycenter of x'^ , it belongs to the convex hull of x'-^ . The feasibility 
of the elements of x'-^ , compare Remark rol and the convexity of X now imply that 
Xfix is feasible, too. 



Applying Lemma 17 and the linearity of G, we have 



' yexG / V ' 9GG / ' ' gSG ' ' gGG 



X^ — ^fix 



for all g' £ G. This proves that Xf^^ is a fixed point of G, thus xa^ G FiX(3(IR"). 
By Theorem [7] we already know that 



c*x^ — c*x 



for all g E G, hence 



c a^fix — ^ ] c x^ — y ^ c X — c X . 

' ' see ' ' ssG 

This shows that Xfix has the same utility value as a;. □ 
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The application of Theorem [18] to a solution x* of A leads to a remarkable result. 
The following corollary, which records this result, is of vital importance, since it 
prepares the ground for the algorithm we are going to present subsequent to this 
theoretical part. 

Corollary 19. //A has a solution, there also exists a solution G FixG'(M"). 

In particular, this result shows that the existence of a solution of A implies the 
existence of a solution of A restricted to FiX(3(IR"). Furthermore, the point - 
and therefore every solution of the restricted LP - has the same objective value as 
a solution of A. Consequently, we only need to solve the restricted problem to get 
a solution for the original LP. As we will see in the next section, this kind of rela- 
tionship between two LP problems can be very useful. Therefore, we introduce an 
appropriate partial order ^ on the family of LP problems of dimension n reflecting 
this relationship. 

Definition. Let Ai and A2 be two linear programs of dimension n and ATi, C M" 
the corresponding feasible regions. Then the linear program A2 is less or equal than 
the linear program Ai if the following three conditions are satisfied: 

• The solvability of Ai implies the solvability of A2 . 

• X2 is a subset of Xi. 

• The maximal utility values on the feasible regions Xi and X2 coincide, that 
is, 

maxc*Xi — max 0*^2 . 
In this case, we write A2 ^ Ai. 

Obviously, the relation < is reflexive, asymmetric and transitive, and thus a par- 
tial order. 

Before we turn to practical aspects, we want to direct attention to a special 
property of the result in Corollary [T9j The statement connotes that the symme- 
try of a linear program is always reflected in one of its solutions. This is what 
W. C. Waterhouse calls the Purkiss Principle in his studies on the question: 

Do symmetric problems have symmetric solutions? 

In one of his papers, see [H], he gives a list of concrete examples for this principle, 
but he also shows that this property can not be taken for granted. 

7. Substitutions and Retractions 

In this section our goal is to benefit from the results of the previous section by 
exploiting the ordering of an LP and its restriction to the set of fixed points with 
respect to The following theorem yields a detailed insight into this relation. 

Theorem 20. Let G he a symmetry group of A. Then there exists a matrix P only 
depending on the orbits of G such that the LP 

max c*{Px) 

s.t. A{Px) <b, Px£ Rio, X e FixG(M") 
is less or equal than A with respect to the order ^. 
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Proof. Let O be the set of orbits O of G on Referring to Remark [8} we consider 
the orbits O to be of the form 

Oi = {esi_i+i, ■ ■ ■ ,esi_-^+ni} ■ 

Let Vi be the subspace of M" defined by 

i-l k 

^ - ® Fixam ® . 

j=i 

In order to project every Vi onto FixG(Vi) = (X]j4i 6si_i+j), we define the Hnear 
maps fp, by 

/p. : M" ^'V^ , 

{Ej=i es,_i+j if ^ = Sj-i + 1 
if ; e {si_i +2, . . . ,s,_i + nj 

e; otherwise . 

Hence, the first element of Oi is mapped to the sum of the elements of Oi, while 

the other elements of Oi are mapped to 0. 

The n x n-matrix Pi corresponding to fp. is defined by 

/I ... ON 

P^■■= \ P, \, Pr.= 



/„_ 



n—Si 



\i ... Oy 



where Pi G M"*^"'. According to Theorem |14| we have 

k 

FixG(M") =0FixG(V-) . 

1=1 

Therefore, we are now able to define the map fp : M" FixG(]R") by 

fp{x) = Fx, 

where 

(7) P:^Y[P,^\ •. I ^{B^,...,B,) 



Pi 






















Pk 



By Corollary [19) we know that the restricted LP 
(8) 



max c*x 



s.t. Ax<b, xe M|o> X G FixG(M") . 

is less or equal than A. Since fp is a projection onto FixG(IR"), we have Px = x 
for all X e FixG(ffi"). Hence, the LP ^ is equal to 

max c*{Px) 

s.t. A{Px) <b, Pxe E|o, x e FixG(M") . 

The transitivity of ^ now implies that the LP ([6| is less or equal than A. □ 

Variables of a linear program that are tied together in one orbit are closely 
related. Therefore, we introduce a notation for sets of such variables 
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Notation. Let O be an orbit on the standard basis B of M". The set of variables 
of an LP corresponding to the elements of O is denoted by Xq- 

In order to translate the result of Theorem [20] into an applicable algorithm, we 
perform the so-called substitution procedure computing c* = c*P and A = AP in 
the LP ([6]). According to the definition of P, see Q, the resulting LP is given by 



(9) 

where 



s.t. Ax <b, Pxe M'^o, X e FixG(M") , 



"1 nk 

(10) c* = c*P=(^c„0,...,0, ... ,^c,,_,+„0,...,0)* 

j=i j=i 



and 

A = AP={ABi,...,AB,,...,ABk) . 
Straightforward computation yields 



(11) AB, = A\PA^ 



. .. . 



Furthermore, we have 

Px =(a;i,...,a;i, ... ^Xsf.-i+ii ■ ■ ■ ^^sk-i+i^ 



This representation reveals that - except for the constraint x G FixG(IR") - the 
inequality system of the new LP does not depend on the variables 

for all i e {1, . . . , fc}. Furthermore, the coefficient of the representative Xsi_i+i of 
each set Xo^ accumulates the original coefficients of all variables in Xq; . Therefore, 
we can interpret this procedure as a simultaneous substitution of the elements of 
each Xoi by the representatives Xsi_i+i- 

Notation. The LP that is derived from A by simultaneously substituting the 
elements of each Xoi by the representatives Xsi_i+i^ ^ind adding the constraint 
X e FixG(M") is denoted by Sub(A). 



By Theorem 20 we already know that Sub(A) is less or equal than A. Hence, 
we only need to solve Sub(A) to obtain a solution of A. This fact can be expressed 
in the following way. 

Corollary 21. Every solution o/Sub(A) is a solution of A as well. 

A first application of the substitution procedure to a basic example will shed 
light on the effectiveness and the potential of the algorithm. 
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Example 22. Consider the following LP Ao given by the inequality system Ax < b 
and the utility vector c, where 



/I 1 



1 
\0 1 



0\ 

1 1 

1 

1/ 



2 
3 

V3; 



and 



We can expand this LP to 



(1,1,2,2)* 



and 



Xi 



Xi 



C*X 



X2 



X2 



X3 
X3 



< 1 

X4 < 2 

< 3 

X4 < 3 



Xi+ X2 + 2X3 + 2X4 



Obviously, we can exchange xi and X2 without affecting c or the inequality system 
if we exchange X3 and X4 at the same time. Therefore, this LP has 

G:= ((1 2)(3 4)) 

as a symmetry group, and G divides B into the two orbits Oi — {ei, 62} and O2 = 
{63,64}. 

Applying the substitution procedure to the set of orbits O = {Oi,02}, we obtain 
the new LP Sub(Ao) defined by Ax < b and c, where 

/I 



AP 



1 



1 

yo 1 



\ 














/2 












1 



















2 













1 







1 





1 












1 


0^ 




V 





1 


0/ 



and 



c* = c*F= (1,1, 2, 2) 



/I 
1 
1 
\0 1 

The expanded version of the new LP is given by 






0/ 



(2,0,4,0) 



2x1 



and 



< 1 

2x3 < 2 

xi + X3 < 3 

xi + X3 < 3 

c^x = 2xi + 4x3 , 



where x € FixG(IR"). According to Corollary 21 we only need to solve the LP 
Sub(Ao) which is almost independent of the variables X2 and X4. 
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Note that the LP Sub(Ao) can actually be derived from the original LP by 
substituting xi for X2 and for x^. Furthermore, we observe that we do not use 
any detailed knowledge about the structure of the group G except for the specific 
decomposition of B into orbits. Therefore, we can apply the substitution procedure 
to any LP problem with known orbit decomposition even if we do not have any 
additional information about the group structure of the symmetry group G of the 
linear program. 

Remark 23. Regarding Theorem \2C\ and the substitution procedure, the assumption 
of having a certain group G can he relaxed to the assumption of having a certain 
orbit decomposition. 

Except for the constraint x G FiX(3(K"), the LP Sub(A) is completely indepen- 
dent of certain variables. Therefore, we now focus on a reduction of the dimension 
of the LP. This reduction can be realized by a certain operator, which we are now 
going to introduce. 

Definition. Given an LP A with the set of orbits O = {Oi, ■ ■ ■ , Ok}, the retraction 
r is defined by r : Sub (A) i— > A', where 



A' = 



and Mr G M"""" is defined by 



max c^MrH 



s.t. AMrV <b, ye M^o 



Mr = {vi,...,Vk), v,= J2 ej = (0,...,0,1,...,1,0,...,0)' . 

The LP A' is called the retract of Sub(A), and we denote A' by Ret(Sub(A)). 

Note that in contrast to the n-dimensional LP Sub(A), the dimension of Ret(Sub(A)) 
is equal to the number of orbits, which coincides with the dimension of the set of 



fixed points FixG(M"), see Theorem 14 



Remark 24. Given a linear program A with the set of orbits O = {Oi, . . . ,Ok\) 
the retract Ret(Sub(A)) o/Sub(A) is a linear program of dim,ension k. 

To justify the term retraction, we introduce an appropriate inclusion t satisfying 

r o i id . 



Definition. The inclusion l is defined by t : Ret(Sub(A)) i— > A", where 
A" 



max c^MrM^x 



s.t. AMrM.x <b, x€ M|o> ^ G FixG(M") 

and M, e M''''" is defined by 

Aft = (ei,esi+i,...,esfc_^+i)* . 

The retraction r applied to the LP Sub(A) eliminates the zeros in the represen- 
tations (10 1 and (11) of c and A. Conversely, the inclusion t reintroduces these 
zeros in the following sense: 
Obviously, can be written as 

M, = (Ci,...,Cfc), Q = (e,,0,...,0)eE'=><"' , 
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where is the i-th unit vector in M'^. Referring to the block representation P = 
{Bi, . . . , given in ([t]), we have 

for every j S {1, . . . , fc}, and therefore 

(12) MrM, = {MrCi,..., MrCk) = (Bi , . . . , Bfc) - P . 

Using the property PP = P of the projection matrix P, we finally get 
c^MrM.x = c*Px c'PPx = c*Pa; = c*a; 

and 

iM^M.x = APx = APPa; = APx = Ax . 

This shows that A" = Sub(A), and thus 

(r o t)(Ret(Sub(A))) = r(Sub(A)) = Ret(Sub(A)) . 

With respect to this category theoretical property, we now want to show that we 
only need to solve the retract Ret(Sub(A)) of Sub(A). For this, we analyze the 
linear maps 



and 



x^ M,x 



by considering the corresponding matrices and M^. On the one hand, the 
retraction r maps any element of K*^ to an element of Fix(3(K"). On the other hand, 
the map t applied to a vector x G K" picks exactly the representative Xs^.^+i of 
each set Xoi- Concerning the LP problems Sub(A) and Ret(Sub(A)), this behavior 
has the following effect. 

Lemma 25. Let X be the feasible region o/Sub(A). Then the following statements 
hold: 

i) If y is feasible for Ret(Sub(A)), then x := Mry is feasible for Sub(A). 
ii) The feasible region o/Ret(Sub(A)) is given by Y := M^X. 
Hi) The LP problems Sub(A) and Ret(Sub(A)) have the same maximal utility 
value. 

Proof. We will use the statement in |i| to prove ^ , and the representation in |ii| to 
show |ni| . 

i) Let J/ be a feasible point of Ret(Sub(A)). Since y is in M>q and r maps M.^ to 
FixG(M"), the point x = M^y is in M^Jq n FixG(E"). Mor^eover, we have 

Ax = AMry < b , 

that is, the point x is feasible for Sub(A). 

ii) Let X be in X. Then x is feasible for Sub(A), and thus 

2; e MJlonFixG(M") . 

Therefore, we have Px — x and M^x E ]R>q. By the equality MrM^ — P, see 
(12 1, we obtain 

AMr{M,x) = A{MrM,)x = A{Px) ^ Ax<b , 
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that is, Mi^x is feasible for Ret(Sub(A)). Conversely, let y be a feasible point 
of Ret(Sub(A)). According to|i]), the point x = Mj.y is feasible for Sub(A). 
Straightforward computation yields 

M.Mr^h ■ 

Hence, y can be written as 

y = hy = M.Mry = M,x . 

Therefore, any feasible point of Ret(Sub(A)) is in M^X. Conclusively, the set 
Y — M^X defines the feasible region of Ret(Sub(A)). 
iii) Since AT is a subset of FixG(M"), the definition of Y given in|ii]) yields 

MrY = MrM.X = PX = X . 

Therefore, we can write 

maxc*a;= max x — max Airy , 

xeX xeM,.Y yeY 

hence the optimal values of Sub(A) and Ret(Sub(A)) are equal. 

□ 

The relations M^Mr — Ik and MrM^x ~ x for all x e FixG(M") which we used 
in our proof reveal in particular that l and r are bijective and mutually inverse if we 
restrict i to FixG(K"). The following corollary records this interesting relationship. 

Corollary 26. The linear maps 

r : K'' ^ FixG(M"), y ^ NUy 

and 

.|,,^^,,„, :FixG(M")^M^ x^M.x 

are bijective and mutually inverse. 

The following theorem proves that we only need to solve Ret(Sub(A)) instead 
of Sub(A). Furthermore, it provides a method how to regain a solution of Sub(A) 
from a solution of Ret(Sub(A)). 

Theorem 27. Let X be the feasible region o/Sub(A). Given that Sub(A) has a 
solution, we can prove the following statements, 
i) The LP Ret(Sub(A)) has a solution as well. 

ii) Any solution y* o/ Ret(Sub(A)) induces a solution x'^^ :— Mj.y* of the linear 
program Sub(A). 

Proof. The proof essentially relies on Lemma |25[ 
i) Let be a solution of Sub(A). We show that y* defined by 

y* := M,xl^ 

is a solution of Ret(Sub(A)). By part of Lemma 25 the feasible region of 
Ret(Sub(A)) is given by Y := M^X. Since is in X, the point y* is feasible 
for Ret(Sub(A)). According to 25][iii| , we have 

max c*M,.?; = maxc*a; = c'a;^^ = c*Pa::^^ = c*-{MrMi)x*f^^ = 

= c'Mr{M,xl,) = c'Mry* , 
that is, the point y* is a solution of Ret(Sub(A)). 
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ii) Let y* be a solution of Rct(Sub(A)). Using 25 i|, the point — Mry* is 
feasible for Sub(A). By 25|pli ), we now get 



maxc*x = max c* Mr 2/ — c^M^y* 



This shows that = Mry* is a solution of Sub(A). 



□ 



Combining Corollary |2T] and Theorem [25] we conclude that it suffices to solve 
the fc-dimensional retract Ret(Sub(A)), compare Remark 24 in order to obtain a 
solution of Sub(A), which then is a solution of the original n-dimensional linear 
program A. 

Finally, we resume Example [22] to study the effects of the final two steps of the 
algorithm. 



Example 22 (continued). Consider the LP Sub(Ao) defined by 

/2 0\ 
2 
10 10 

yi 1 0/ 

Then the LP Ret(Sub(Ao)) is given by 



c* = (2,0,4,0) 



Ret(Sub(Ao)) 



max c Mry 

s.t. AMry <b, y € 



where 



c^Mr = (2,0,4,0) 



and 



AMr = 



Expanding Ret(Sub(Ao)), we get 



/2 








'\ 








2 





1 





1 





V 





1 


0/ 



2yi 

yi 
yi 



(I o\ 

1 

1 

Vo 1/ 

/I o\ 

1 
1 

vo ij 



< 1 

2y2 < 2 

y2 < 3 
y2 < 3 



(2,4) 



/2 







2 


1 


1 


V 


1/ 



and 



c'Mry = 2yi + 4t/2 



Obviously, this LP can be solved at a glance. The solution is given by 

2/* -(0.5,1)* . 



22 



KATRIN HERR AND RICHARD BODI 



In order to get a solution of Sub(Ao), we multiply y* by Mr- By Theorem 27 the 
point 



xl^ = Mry* = 



/I 






/0.5\ 


1 







0.5 





1 




1 




V 







is a solution of Sub(Ao). Finally, Corollary 21 guarantees that a;^^ is a solution of 
Aq as well. 



In the procedure we developed, we take advantage of symmetries by deriving a 
linear program of smaller dimension, which still contains enough information to ex- 
tract a solution of the original LP. The elaboration of our method revealed that the 
complexity of the derived linear program solely depends on the number of orbits, 
not on the concrete structure of the symmetry group. Therefore, transitivity of the 
symmetry group suffices to obtain the best possible result. 

But even the knowledge about one single symmetry of a linear program already 
effects a reduction of the dimension, since every symmetry generates a symmetry 
group of the linear program and reduces the number of orbits. Sometimes, the 
derived linear program Ret(Sub(A)) shows further symmetries, even if we already 
considered the full symmetry group of the original problem. In that case, we can 
apply the substitution algorithm iteratively. 

Of course, it is not clear how to determine symmetries of arbitrary linear pro- 
grams. But in practice, some of the symmetries already attract attention during 
the construction of the linear programs. For instance, think of the graph-coloring 
problem, where it is obvious that the variables representing the colors can be ex- 
changed. Therefore, the substitution procedure or algorithm should be understood 
not so much as a part of the solving process, but as a pre-processing step in order to 
produce a lower-dimensional linear program. In this respect, it would be reasonable 
to formulate linear programs as symmetric as possible. 
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